Scheduling  Constrained-Deadline  Parallel  Tasks  on 
Two-type  Heterogeneous  Multiprocessors 

Bjorn  Andersson  Gurulingesh  Raravi 

Carnegie  Mellon  University  Xerox  Research  Centre  India 


Abstract — Consider  the  problem  of  scheduling  a  taskset 
on  a  multiprocessor  so  that  all  deadlines  are  met.  Assume 
(i)  constrained-deadline  sporadic  tasks,  i.e.,  a  task  generates  a 
sequence  of  jobs  and  the  deadline  of  a  job  is  no  greater  than 
the  minimum  inter-arrival  time  of  the  task  that  generates  the 
job,  (ii)  stage-parallelism,  i.e.,  a  task  comprises  one  or  more 
stages  with  a  stage  comprising  one  or  many  segments  so  that 
segments  in  the  same  stage  are  allowed  to  execute  in  parallel  and 
a  segment  is  allowed  to  execute  only  if  all  segments  of  the  previous 
stage  have  finished,  (iii)  two-type  heterogeneous  multiprocessor 
platform,  i.e.,  there  are  processors  of  two  types,  type-1  and  type- 
2,  and  for  each  task,  there  is  a  specification  of  its  execution  speed 
on  a  type-1  processor  and  on  a  type-2  processor,  and  (iv)  intra¬ 
type  migration,  i.e.,  a  job  can  migrate  between  processors  of  the 
same  type  but  for  a  task,  all  jobs  of  this  task  must  execute  on  the 
same  processor  type.  We  present  an  algorithm  for  this  problem;  it 
assigns  each  task  to  a  processor  type  and  then  schedules  tasks  on 
processors  of  each  type  with  global-Earliest-Deadline-First.  This 
algorithm  has  pseudo-polynomial  time  complexity  and  speedup 
factor  at  most  5.  This  is  the  first  algorithm  for  scheduling  parallel 
real-time  tasks  on  a  heterogeneous  multiprocessor  with  provably 
good  performance. 

I.  Introduction 

Software  systems  are  expected  to  do  more  with  less,  i.e., 
providing  more  functionality  and  greater  performance  with 
lower  size,  weight,  and  power  consumption.  The  real-time 
systems  community  has  taken  a  great  interest  in  developing 
methods  that  provide  foundations  for  doing  so  while  ensuring, 
before  run-time,  that  the  software  system  can  respond,  at  run¬ 
time,  to  certain  events  within  pre-specified  time  constraints 
(deadlines).  Such  foundations  include  algorithms  for  assign¬ 
ing  tasks  to  heterogeneous  multiprocessors.  These  algorithms 
are  useful  because  heterogeneous  multiprocessors  typically 
provide  more  processing  power  per  watt.  Other  foundations 
include  algorithms  for  scheduling  tasks  that  can  execute  in 
parallel  on  multiprocessors.  These  algorithms  are  relevant  for 
computations  responding  to  events  that  have  so  tight  deadlines 
that  even  if  a  computation  is  executed  on  a  system  with  no 
other  computations  present,  the  only  way  for  the  computation 
to  meet  its  deadline  is  to  perform  execution  in  parallel. 

A  computer  platform  is  a  homogeneous  multiprocessor 
(sometimes  called  identical  multiprocessor )  if  the  execution 
speed  of  all  tasks  is  the  same  on  all  processors.  Conversely,  a 
computer  platform  is  a  heterogeneous  multiprocessor  (some¬ 
times  called  unrelated  multiprocessor )  if  the  execution  speed 
of  a  task  depends  on  both  the  processor  and  the  task.  A 
heterogeneous  multiprocessor  is  two-type  if  it  has  two  types 


of  processors  (a.k.a  two-type  platform ).  Analogously,  a  hetero¬ 
geneous  multiprocessor  is  t-type  if  it  has  t  types  of  processors 
(a.k.a  t-type  platform ).  For  two-type  platforms,  the  problem  of 
assigning  tasks  to  processors  is  NP-hard  in  the  strong  sense 
and  the  problem  of  assigning  tasks  to  processor  types  is  NP- 
hard  [1],  For  t-type  platforms,  both  the  problems  are  NP-Hard 
in  the  strong  sense  [2],  [3],  Consequently,  the  research  commu¬ 
nity  has  developed  approximation  algorithms  (i.e.,  algorithms 
with  finite  speedup  factors)  for  assigning  tasks  to  processors 
and  to  processor  types  [1]— [15]  on  such  platforms. 

Related  work.  The  algorithms  for  assigning  implicit- 
deadline  sporadic  tasks  (i.e.,  a  task  generates  a  sequence  of 
jobs  and  a  job  has  a  deadline  that  is  equal  to  the  minimum 
inter-arrival  time  of  the  task  that  generates  the  job)  to  proces¬ 
sors  and  to  processor  types  for  two-type  platforms  [1],  [5]— [8] 
have  lower  time  complexity  than  the  algorithms  for  t-type  plat¬ 
forms  [2]-[4],  [9]— [  14]  while  maintaining  their  performance 
bound.  In  addition,  an  algorithm  for  scheduling  arbitrary- 
deadline  sporadic  tasks  (i.e.,  a  task  generates  a  sequence 
of  jobs  and  a  job  has  a  deadline  that  may  be  less  than  or 
greater  than  or  equal  to  the  minimum  inter-arrival  time  of 
the  task  that  generates  the  job)  on  t-type  platforms  is  known 
as  well  [15].  However,  they  [1]— [15]  do  not  support  parallel 
tasks.  The  research  community  has  also  presented  algorithms 
with  proven  speedup  factors  for  scheduling  parallel  tasks  on 
homogeneous  multiprocessors  [  16]— [22],  Further,  there  are 
other  algorithms  [23]— [33]  with  no  proven  speedup  factors 
for  scheduling  parallel  tasks  on  homogeneous  multiprocessors 
—  some  of  them  [23]— [28]  are  for  constrained-deadline  tasks 
(i.e.,  a  task  generates  a  sequence  of  jobs  and  a  job  has  a  dead¬ 
line  that  may  be  less  than  or  equal  to  the  minimum  inter-arrival 
time  of  the  task  that  generates  the  job)  and  the  others  [29]— [33] 
are  for  implicit-deadline  tasks.  Unfortunately,  none  of  these 
works  [  16]— [33]  support  heterogeneous  multiprocessors  (and 
moreover  most  of  these  algorithms  [23]— [33]  have  no  proven 
speedup  factors).  A  work  by  Holenderski  et.  al.  [34]  comes 
closest  to  ours  as  it  also  deals  with  the  problem  of  scheduling 
parallel  tasks  on  heterogeneous  multiprocessors.  However,  the 
approach  presented  in  [34]  has  no  proven  speedup  factor. 

This  research.  In  this  paper,  we  present  a  pseudo¬ 
polynomial  algorithm  for  scheduling  constrained-deadline  par¬ 
allel  tasks  on  a  two-type  heterogeneous  multiprocessor  and 
prove  its  speedup  factor.  Our  approach  assigns  each  task 
to  a  processor  type  and  then  uses  global-Earliest-Deadline- 
First  (gEDF)  on  the  processors  of  each  type  to  schedule  the 


1 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1 .  REPORT  DATE  2.  REPORT  TYPE 

13  JAN  2015  N/A 

3.  DATES  COVERED 

4.  TITLE  AND  SUBTITLE 

Scheduling  Constrained-Deadline  Parallel  Tasks  on  Two-type 
Heterogeneous  Multiprocessors 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

Raravi  /Bjorn  A.  Andersson  Gurulingesh 

5d.  PROIECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Software  Engineering  Institute  Carnegie  Mellon  University  Pittsburgh, 

PA  15213 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release,  distribution  unlimited. 

13.  SUPPLEMENTARY  NOTES 

The  original  document  contains  color  images. 

14.  ABSTRACT 

15.  SUBIECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

18.  NUMBER  19a.  NAME  OF 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE  S  AR 

unclassified  unclassified  unclassified 

13 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


stage:  1  stage:  2 


stage:  nSj 
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segments:  nsegj  x  segments:  nsegj  2  segments:  nsegy  segments:  nsegj  ns 

Fig.  1:  The  parallel  task  model  studied  in  this  paper. 


respective  tasks.  We  show  that  our  new  algorithm  has  pseudo¬ 
polynomial  time  complexity  and  speedup  factor  at  most  5.  We 
study  the  constrained-deadline  sporadic  task  model  and  we 
consider  parallelism  with  stages  (i.e.,  a  task  is  described  with 
one  or  more  stages  with  each  stage  comprising  one  or  many 
segments  such  that  segments  in  the  same  stage  are  allowed  to 
execute  in  parallel  but  a  segment  is  only  allowed  to  execute 
if  all  segments  of  the  previous  stage  have  finished  execution). 
This  work  makes  the  following  contribution:  it  presents  the 
first  algorithm  for  scheduling  parallel  real-time  tasks  on  a 
heterogeneous  multiprocessor  with  a  proven  speedup  factor. 

Organization  of  the  paper.  The  rest  of  this  paper  is  orga¬ 
nized  as  follows.  Section  II  states  the  system  model.  Section  III 
lists  previous  results  on  parallel  scheduling  on  homogeneous 
multiprocessors  and  also  proves  new  lemmas  that  we  use  later 
in  the  paper.  Section  IV  presents  our  new  algorithm  for  two- 
type  heterogeneous  multiprocessors  and  proves  its  speedup 
factor  and  time  complexity.  Section  V  concludes. 

II.  System  Model 

We  consider  the  problem  of  scheduling  a  set  r  of 
constrained-deadline  sporadic  tasks  on  a  two-type  heteroge¬ 
neous  multiprocessor  platform  II  comprising  mi  processors 
of  type-1  and  m 2  processors  of  type-2.  A  task  t,  £  r 
is  characterized  by  a  minimum  inter-arrival  time  Tt  and  a 
deadline  Di  such  that  Di  <  T:.  Each  task  t,  generates  a 
sequence  of  jobs ,  with  the  first  job  arriving  at  any  time  and 
subsequent  jobs  arriving  at  least  T,;  time  units  apart. 

The  execution  of  a  task  r,;  is  described  by  ns,;,  nseg,  and 
Ci j  with  the  interpretation  that  a  job  of  r,;  has  ns^  stages 
with  stage  j  comprising  nseg.^  3  segments  with  each  segment 
of  stage  j  having  execution  requirement  at  most  C,  :/  —  see 
Fig.  1 .  A  segment  finishes  when  it  performs  a  number  of  units 
of  execution  equal  to  its  execution  requirement.  A  segment 
executing  contiguously  for  L  time  units  on  a  processor  of 
speed  s  performs  L  x  s  units  of  execution.  A  segment  of  a 
job  is  allowed  to  execute  only  if  all  segments  of  its  previous 
stage  have  finished.  A  job  finishes  when  all  segments  of  its 
last  stage  have  finished.  If  a  job  of  t,  finishes  at  most  l),  time 
units  after  its  arrival,  then  it  meets  its  deadline. 

On  a  two-type  platform,  the  execution  speed  of  a  job 
depends  on  the  type  of  processor  on  which  it  executes.  Let  rj 


and  if  denote  the  execution  speeds  of  a  job  of  task  t,  when 
it  executes  on  a  processor  of  type-1  and  type-2  respectively. 
We  now  define  terms  that  we  use  in  the  rest  of  the  paper. 

Definition  1  (Legal  jobset).  If  for  each  task  in  the  taskset  t, 
the  task  is  assigned  the  number  of  jobs  it  generates  and  each 
job  is  assigned  an  arrival  time  such  that  the  minimum  inter¬ 
arrival  time  constraint  is  satisfied  and  each  segment  of  a  job  is 
assigned  an  execution  requirement  such  that  the  upper  bound 
on  execution  requirement  of  a  segment  is  respected,  then  we 
say  that  the  resulting  jobset  is  a  legal  jobset  with  respect  to  r. 

Definition  2  (Intra-migrative  schedule).  A  schedule  is  intra- 
migrative  if  both  of  the  following  conditions  are  true:  (i)  jobs 
are  allowed  to  migrate  between  processors  of  the  same  type 
and  (ii)  for  each  task,  it  holds  that,  if  a  job  executes  on  a 
processor  of  one  type  then  all  other  jobs  of  this  task  execute 
on  processors  of  the  same  type. 

Definition  3  (Intra-migrative  feasible  taskset).  A  taskset  r  is 
ultra-migrative  feasible  on  a  two-type  platform  II  if  for  each 
jobset  that  is  legal  with  respect  to  t  there  exists  an  intra- 
migrative  schedule  in  which  all  deadlines  are  met. 

Definition  4  (S'-Schedulable  task  set).  A  taskset  r  is  S- 
schedulable  on  a  two-type  platform  II  if  for  each  jobset  that  is 
legal  with  respect  to  t,  for  each  schedule  that  S  can  generate 
from  the  jobset,  it  holds  that  the  schedule  is  intra-migrative 
and  all  deadlines  are  met. 

Definition  5  (Speed  of  the  computing  platform).  If  II  is  a 

two-type  platform  then  let  H  X  x  denote  a  two-type  platform 
where  the  speed  of  each  processor  is  multiplied  by  x. 

Definition  6  (Speedup  factor).  A  scheduler  S  has  a  speedup 
factor  SF g  if,  for  each  taskset  t,  for  each  two-type  platform 

II,  it  holds  that:  if  t  is  intra-migrative  feasible  on  II  then  t 
is  S-schedulable  on  II  x  SFg. 

In  order  to  simplify  our  discussion  in  the  rest  of  the 
paper,  we  rewrite  our  model  to  an  equivalent  formulation  as 
follows.  Instead  of  using  rj,  and  rj,  we  use  parameters 
C]  ? ,  Cf  j,  and  s  selected  as  follows:  Cjj/s  =  Cij/rj  and 
Cfj/s  =  We  let  s.t.  mean  such  that  and  :  mean  it 

holds  that.  We  let  {x\f(x)}  denote  a  set  of  elements  so  that 
an  element  x  is  in  the  set  if  and  only  if  f(x)  is  true.  For 
convenience,  we  write  the  predicate  (Vi  >  0  :  x)  to  mean 
the  predicate  (Vi  s.t.  i  >  0  :  x).  For  convenience,  we  also 
define  DMAX  =  maxTi6r  Dt,  DMIN  =  min T.&T  Di,  and 
TMAX  =  maxTigT  Tj. 

III.  SCHEDUL ABILITY  ANALYSIS  OF  PARALLEL  TASKS  ON 

a  Homogeneous  Multiprocessor 

It  is  known  that  there  is  no  optimal  online  algorithm  for 
scheduling  constrained-deadline  sporadic  tasks  on  a  homoge¬ 
neous  multiprocessor  (even  for  tasks  without  parallelism)  [35]. 
Therefore,  in  this  paper,  we  use  global-Earliest-Deadline-First 
(gEDF)  scheduling  policy  as  it  has  a  good  speedup  factor  [36]. 
There  is  a  brute-force  approach  [37]  which  provides  exact 
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Fig.  2:  Previously  known  [18]  schedulability  analysis  for  gEDF  scheduling  of  parallel  tasks  on  a  homogeneous  multiprocessor. 
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Fig.  3:  New  expressions  we  will  use. 


schedulability  test  for  gEDF  but  it  has  a  very  large  time- 
complexity  and  it  does  not  support  parallel  tasks  and  it  requires 
that  tasks  parameters  are  integers.  Therefore,  in  this  paper,  we 
use  a  sufficient  (not  exact)  schedulability  test  for  parallel  tasks. 

The  research  literature  provides  many  sufficient  schedula¬ 
bility  tests  for  gEDF  for  tasks  that  are  not  parallel  —  see 
for  example  [38]— [40] .  Of  particular  interest  is  [40]  which 
provides  a  schedulability  test  with  a  speedup  factor  of  two. 
This  schedulability  test  states  that  if  there  exists  a  cr  such 
that  cr  is  at  least  as  large  as  the  density  of  each  task  and 
if  it  holds  for  each  value  of  t  that  the  sum  of  ffdbf  of 
tasks  is  at  most  a  certain  value  then  the  taskset  is  gEDF- 
schedulable.  Here  ffdbf  means  forced-forward  demand-bound 
function;  it  is  a  function  which  describes  the  maximum  amount 
of  execution  a  given  task  can  demand  in  a  time  interval  of 


duration  t.  Later  work  [18]  extended  this  for  parallel  tasks 
and  this  was  done  by  defining  ffdbf  for  parallel  tasks.  Fig.  2 
shows  this  schedulability  test  (see  Eq.  (6))  for  parallel  tasks  on 
a  homogeneous  multiprocessor  [18]  comprising  m  processors. 
It  also  shows  a  feasibility  test  (see  Eq.  (7))  for  parallel  tasks 
on  a  homogeneous  multiprocessor.  Since  this  formulation  is 
for  homogeneous  multiprocessors,  we  do  not  have  the  1  and 
2  on  Cij.  Some  basic  properties  of  ffdbf  are  shown  below. 

Lemma  1.  Vfo  >  0 ,Vt  >  to  :  ffdbf  (r*,  to,  v,  s)  < 
ffdbf  (■ Ti,t,v,s ) 

Proof:  Follows  from  inspection  of  terms  in  Eqs.  (l)-(4). 


Lemma  2.  \fl  G  Z  :  ffdbf  (r*,  t  +  1  x  T*,  v,  s)  = 
ffdbf  (r»,  t,v,s)  +  l  x  Ci 
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Fig.  4:  A  naive  formulation  of  constraints  for  task-to-processor-type  assignment. 


Proof:  Follows  from  Eq.  (4).  ■ 

Let  mi(r),  in  Fig.  3,  be  a  duration  of  a  time  interval. 

Lemma  3.  ffdbf  (77,  mi(r),  v,  s)  +  Ci  < 
ffdbf  (77, 2  x  mi(r),  v,  s) 

Proof:  Applying  Lemma  2  with  t  =  mi(r)  and  1  =  1, 
yields  ffdbf  (77,  mi(r),  v,  s)+C,  =  ffdbf  (77,  mi(r)  +  T*,  u,  s). 
From  the  definition  of  rni(r),  it  follows  that  Ti  <  rni(r). 
Applying  this  on  the  above  and  using  Lemma  1  yields 
ffdbf  (ri,  mi(r),  v,  s)  +  Ci  =  ffdbf  (77,  2  x  mi(r),  i>,  s).  This 
states  the  lemma.  ■ 

We  will  now  introduce  a  function  ffdbf*  (used  to  over¬ 
approximate  ffdbf)  such  that  for  inputs  where  t  is  less  than 
or  equal  to  mi(r),  it  holds  that  ffdbf*  (77,  t,  v,  s,  r)  is  a 
staircase  function  and  for  t  greater  than  mi(r),  it  holds  that 
ffdbf*  (77,  t,  v,  s,  t)  increases  linearly  with  t.  Formally,  Eq.  (9) 
in  Fig.  3  shows  the  definition  of  ffdbf*  (ri,t,v,  s,t). 

Lemma  4.  Vt  >  0  :  ffdbf  (77,  t,  v,  s)  <  ffdbf*  (77,  t,  v,  s,  r) 

Proof:  See  Appendix.  ■ 

Definition  7.  TS(t,  0)  d=  {t\(2^  *\  =  f)  A  (DMIN  x  (1  - 
6)  <t<  mi  (r))}  U  {2l>g2pMiNx(i-0))j } 

Lemma  5.  Vt  £  TS(t,6)  :  ffdbf*  (Ti,  t,  v,  s,  r)  = 

ffdbf  (^,  2  x  t,v,s) 

Proof:  Follows  from  definition  of  ffdbf*  and  Definition  7. 

■ 

IV.  New  scheduling  algorithm  and  speedup  factor 

In  this  section,  we  discuss  scheduling  on  a  two-type  hetero¬ 
geneous  multiprocessor.  We  will  use  notations  in  Fig.  2  and 
Fig.  3  but  with  1  as  superscript;  this  superscript  indicates  that 
the  quantity  if  based  on  C]  .■ .  Ditto  for  type-2.  For  example, 

from  Eq.  (1)  we  obtain:  Cj  '=  ^7=1  (nsegij  x  ^Ij)  anc^ 
Cf  d=  E"=  1  (nsegij  x  Cfd). 

A.  Developing  the  new  algorithm 

The  problem  of  scheduling  constrained-deadline  parallel 
sporadic  tasks  on  a  two-type  heterogeneous  multiprocessor, 
can  be  solved  in  two  steps.  Step  1:  Before  run-time,  assign 
tasks  to  processor  types  so  that  (i)  tasks  assigned  to  type-1  are 
gEDF-schedulable  on  the  processors  of  type-1  and  (ii)  tasks 
assigned  to  type-2  are  gEDF-schedulable  on  the  processors 
of  type-2.  Step  2:  At  run-time,  schedule  all  tasks  assigned  to 
type-1  with  gEDF  on  processors  of  type-1  and  schedule  all 
tasks  assigned  to  type-2  with  gEDF  on  processors  of  type-2. 


Since  Step  2  is  straightforward,  we  focus  our  discussion  on 
Step  1. 

Step  1  could  be  solved  as  follows.  Let  x]  =  1  indicate  that 
task  Ti  is  assigned  to  type-1  processors  and  let  xf  =  1  indicate 
that  task  77  is  assigned  to  type-2  processors.  Let  X  denote  the 
matrix  of  xt  values  for  all  tasks  in  r.  Then,  by  using  Eq.  (6), 
one  could  solve  Step  1  by  assigning  values  to  Xi  variables  such 
that  all  the  constraints  in  Fig.  4  are  satisfied.  Intuitively,  Cl  in 
Fig.  4  states  that  according  to  the  schedulability  test  of  Eq.  (6), 
the  tasks  assigned  to  type-1  processors  are  gEDF-schedulable 
on  type-1  processors.  C2  is  analogous  for  type-2  processors. 
C3  combined  with  C4  states  that  a  task  is  either  assigned 
to  type-1  or  type-2.  C4  states  that  ^-variables  are  integers. 
Unfortunately,  creating  an  algorithm  that  assigns  values  to  Xi 
such  that  all  the  constraints  in  Fig.  4  are  satisfied  is  challenging 
because  (i)  it  involves  an  exists-quantifier  (Da1  in  Cl  and  Da2 
in  C2)  and  (ii)  it  involves  a  forall-quantifier  (Vt  in  Cl  and  Vt 
in  C2)  and  (iii)  it  has  integer  variables.  Hence,  we  will  now 
present  other  constraints  so  that  if  these  other  constraints  are 
satisfied  then  the  constraints  in  Fig.  4  are  satisfied  as  well. 

Let  6\  and  62  be  non-negative  parameters  that  we  can 
choose.  Then,  instead  of  asking  if  there  exists  a  a1  in  Cl 
in  Fig.  4  with  certain  properties,  let  us  only  consider  those  cr1 
such  that  cr1/s  =  6\.  Then  it  follows  that  if  there  is  a  task  77 

with  x)  =  1  and  jf  >  6 1  x  s(n)  then  Cl  is  violated.  Hence, 

if  9\  is  given  and  o1  / s  =  9\  and  if  jf  >  9\  x  s(n)  then  it 
follows  that  a  necessary  condition  to  satisfy  the  constraints  in 
Fig.  4  is  that  xj  =0.  We  can  reason  analogously  for  and 
C2.  For  this  reason,  we  introduce  the  following  sets. 

1  2 

H12  d=  {ner|(^->flix  s(n))  A(rjf>e2x  s(n))}  uo) 
1  2 

Hld=  {r,6r  (^-  <  6>i  x  s(n))  A  (^-  >  02  x  s(n))}  (11) 

J-'i  C'i 

1  2 

H2  d=f  (n  6  r  >  9i  X  s( n))  A  (^-  <  02  X  s(n))}  (12) 

J-Ji  J-Ji 

1  2 

l  d=  {Tier  1  (^  <  01  X  S(n))  a  (^-  <  e2  x  s(n))}  (13) 

Observe  that  r  =  H12UH1UH2UL.  We  let  H12(0i,02,t) 
denote  H12  for  the  parameters  0i,02,t.  Analogously  for 
H1,H2,  and  L. 

Clearly,  if  9\  and  02  are  given  and  a1  / s  =  9\  and  a2  / s  =  02 
and  if  there  is  a  task  in  H12  then  it  is  impossible  to  satisfy 
the  constraints  in  Fig.  4.  Also,  if  9\  is  given  and  cr1/s  =  9\ 
then  a  necessary  condition  to  satisfy  the  constraints  in  Fig.  4 
is  to  set,  for  each  task  77  €  HI,  x\  =  1.  Analogously,  if  02  is 
given  and  a2 / s  =  02  then  a  necessary  condition  to  satisfy  the 
constraints  in  Fig.  4  is  to  set,  for  each  task  77  £  H2,  x2  =  1. 
This  gives  us  the  constraints  in  Fig.  5.  It  can  be  seen  that 
if  0i  and  02  are  given  and  for  a  matrix  X,  it  holds  that  all 
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Fig.  5:  A  slightly  less  naive  formulation  of  constraints  for  task-to-processor-type  assignment. 


Cl.  V  g  TS(t,0i)  :  (Et-gt  ffdbf*1  (rj,  t,  0i,  s,  t)  x  xj^j  <  ((mi  —  (mi  —  1)  x  9i)  x  (  x  s) 

C2.  V  G  TS  (r,  02)  :  (j2neT  ffdbf*2  (n,t,  92,  s,  r)  x  x?j  <  ((m2  -  (m2  -  1)  x  S2)  x  (  x  s) 

C3.  Vri  gr:x?+i?  =  1 
C4.  Vri  G  r  :  x}  G  (0, 1}  and  x?  G  (0, 1} 

C5.  Vri  G  HI  :  xl  =  1 
C6.  Vn  G  772  :  x?  =  1 
C7.  7712  =  0 

C8.  feTieT(Ci /r<)  x  xi)  <  ((mi  -  (mi  -  1)  x  ^1 )  x  s) 

C9.  (j2T.eT(Cf  /Ti)  x  xfj  <  ((m2  -  (m2  -  1)  x  02)  x  s) 


Fig.  6:  A  formulation  of  constraints  for  task-to-processor-type  assignment. 


Cl.  Vi  G  TS  (r,  0i )  :  (f2T.er  ffdbf*1  (rj,  t,  0i ,  s,t)  x  xj)  <  ((mi  —  (mi  —  l)x0i)xixsx  1/2) 

C2.  Vt  G  TS  (r,  02)  :  (ETier  ffdbf*2  (rj,  t,  02,  s,  r)  x  xf\  <  ((m2  —  (m2  -  1)  x  02)  x  i  x  s  x  1/2) 

C3.  Vri  £  r  :  x|  +  x?  =  1 

C4.  Vri  £  r  :  x}  >  0  and  x?  >  0 

C5.  Vri  S  771  :  =  1 

C6.  Vri  S  772  :  x?  =  1 

C7.  77  12  =  0 

C8.  fErigrtC'/h)  x  xl')  <  ((mi  -  (mi  -  1)  x  B{)  x  s  x  1/2) 

C9.  (Erier(^/Ti)  x  x?J  <  ((m2  -  (m2  -1)x92)xsx  1/2) 


Fig.  7:  A  formulation  of  constraints  for  task-to-processor-type  assignment  —  relaxed  to  LP. 


Cl.  Vt  G  TS  (r,  9i)  :  gr  ffdbf* 1  (ri,t,  9i,  s,t)  x  xj^\  <  ((mi  —  (mi  —  1)  x  0i)  x  t  x  s) 

C2.  Vt  g  TS  (t,  92)  :  ^Erigr  ffdbf* ”  (r,,  t,02,  s,  r)  x  x?J  <  ((m2  -  (m2  —  1)  x  02)  x  i  x  s) 

C3.  Vri  g  r  :  xl  +  x?  =  1 

C4.  Vri  g  r  :  xj  g  (0, 1}  and  x?  G  (0, 1} 

C5.  Vri  G  771  :  xl  =  1 

C6.  Vri  G  772  :  x?  =  1 

C7.  7712  =  0 

C8-  (Erigr  (Ci/Ti)  X  xi)  <  ((™1  ~  (ml  -  1)  X  01 )  X  s) 

C9.  (j2TieACi /Ti)  x  <  ((m2  -  (m2  -  1)  x  02)  x  s) 

CIO.  Vri  G  r  s.t.  f  fx'|  =  l)  V  fx'2  =  l))  '■  x\  =  x'\ 

Cll.  Vri  G  r  s.t.  f(x'|  =  1J  V  fx'f  =  1JJ  :  x?  =  x'f 

C12.  X'  is  the  solution  to  the  problem  in  Fig.  7 


Fig.  8:  A  formulation  of  constraints  for  task-to-processor-type  assignment;  we  will  show  that  this  can  be  computed  in  pseudo¬ 
polynomial  time. 


constraints  in  Fig.  5  are  satisfied  then  all  constraints  in  Fig.  4 
are  satis hed  as  well. 

Note  that  there  is  still  a  Vt  in  Cl  and  C2  in  Fig.  5.  We  will 
now  present  a  set  of  constraints  where  we  only  check  a  finite 
number  of  t  —  see  Fig.  6. 

Lemma  6.  if  X  satisfies  Fig.  6  then  X  satisfies  Fig.  5. 


Proof:  Suppose  that  the  lemma  was  false.  Then  there 
exists  t,  II,  0i,  02, -A  such  that  X  satisfies  Fig.  6  and  X  does 
not  satisfy  Fig.  5.  Note  that  it  can  only  be  Cl  or  C2  (or  both) 
in  Fig.  5  that  are  violated. 

If  it  is  Cl  then  we  can  reason  as  follows:  There  must  be  a 
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t  that  violated  Cl  in  Fig.  5.  Hence, 


(m i  —  (mi  —  1)  x  6\)  x  t  x  s 


*  1 

Lemma  5  yields:  ffdbf  (t*,  i0,  0i,  s,  t)  = 

ffdbf1  (tj,  fi,  0i,  s).  Combining  this  with  Eq.  (14),  Eq.  (17), 
and  Eq.  (18)  yields: 

E  ffdbfl  (7»>i iA>s)  x  £;  <  E  ffdbfl  x 

^  ^  TiGr 


Let  us  explore  three  possibilities: 

Case  1:  t  >  mi(r).  Note  that  mi(r)  is  an  element  in 
TS  This  gives  us  from  Fig.  6: 

^E  (ri,mi(r),6»i,s,r)  x  x]^  < 

(mi  —  (mi  —  1)  x  6\)  x  rni(r)  x  s  (15) 
Because  of  C8  in  Fig.  6,  it  also  holds  that: 

^^(Cj/T,;)  x  a;1^  <  (mi  -  (mi  -  1)  x  0i)  x  s  (16) 

Multiplying  Eq.  (16)  by  (t  —  mi(r))  and  adding  to  Eq.  (15) 
and  then  combining  with  Eq.  14  yields: 

(ffdbf*1  (n,mi(T),9i,s,T)  +  ^  x  (t  -  mi(r)))  x  x]  j 

<  (  E  ffdbfl  (a,  s)  x 


VTiGr 


Since  TS(t,6i),  we  can  apply  Lemma  5  on  the  left-most 
term.  Doing  so  and  then  applying  Lemma  3  yields: 

(  E  (ffdbf1  (Tj,mi(T),01,s)  +  C'i  +  ^  x  (t  -  mi(r)))  x  x\ 
Kner  1 

<  |  E  ffdbf1  (■ n,t ,  0i,  s)  x  a:1 

\TiGr 


*1 

Note  that  the  left-hand  side  is  the  expression  of  ffdbf  (the 
second  case  of  Eq.  9).  Hence: 

E  ffdbf*  (n,t,9i,  s,t)  x  x]  <  J2  ffdbf1  (Tj,i,0i,s)  x  a:1 

TiGr  t^Gt 

But  this  contradicts  Lemma  4. 

Case  2:  t  <  DMIN  x  (1  —  0i).  For  such  a  t,  it  holds  that 
ffdbf1  ( n,t,9i,s )  is  zero.  But  this  violates  Eq.  14. 

Case  3:  DMIN  x  (1  —  0i)  <  t  <  mi(r).  Let  us  define  t\ 
as  t\  =  2Llo«2fJ+1  and  let  to  be  ti/2.  It  is  easy  to  see  that 
io  <  t  <  t\.  Note  that  to  is  an  element  in  TS  (t,9 i).  This 
gives  us  from  Fig.  6: 

(E  ffdbf*1  (n,t0,9i,s,T)  x  a:1^  < 

(mi  —  (mi  —  1)  x  $i)  x  to  x  s  (17) 

Since  to  <  t  it  clearly  holds  that: 


Using  this  and  t  <t\  and  Lemma  1  yields  a  contradiction. 

If  C2  is  violated  then  we  can  reason  analogously  to  the  case 
when  Cl  is  violated. 

It  can  be  seen  that  if  the  lemma  is  false  then  each  case 
results  in  contradiction.  Hence,  the  lemma  is  true.  ■ 

Note  that  in  Fig.  6,  there  is  a  finite  number  of  constraints  and 
this  is  what  we  want.  However,  the  X  variables  are  integers 
and  this  makes  the  problem  a  Mixed-Integer  Linear  Program 
(MILP);  the  research  literature  currently  neither  offers  a 
polynomial  time  algorithm  for  solving  general  MILP  nor  for 
solving  MILP  with  the  special  structure  of  Fig.  6.  For  Linear 
Programming  (LP),  polynomial  time  algorithms  are  known 
though  (see  [41],  for  example).  We  will  now  discuss  how  to 
exploit  this.  Fig.  7  shows  an  LP;  it  differs  from  Fig.  6  in  that 
X  variables  are  real  numbers  instead  of  integers  and  it  is  also 
more  constrained  —  s/2  instead  of  s  in  C1,C2,C8,C9.  With 
the  solution  to  this  LP,  we  obtain  a  new  optimization  problem 
—  see  Fig.  8.  This  optimization  problem  is  as  follows.  First, 
we  solve  the  LP  (specified  by  Fig.  7)  and  obtain  a  solution 
X' .  With  this  solution  X' ,  we  consider  the  MILP  (specified 
by  Fig.  6)  and  require  that  for  those  i  such  that  x'\  =  1  or 
x'i  —  1,  the  value  of  xj  should  be  equal  to  x'\  and  the  value 
of  x\  should  be  equal  to  x' i.  There  may  be  some  i:  s  that 
remain;  these  will  be  assigned  values  by  solving  a  MILP  (as 
specified  by  Fig.  8). 

Lemma  7.  If  0  \  and  02  are  given  and  X  satisfies  Fig.  8  then 
satisfies  Fig.  4. 

Proof:  Follows  from  the  discussion  in  this  subsection.  ■ 
Hence,  solving  Fig.  8  yields  an  assignment  of  tasks  to 
processor  types. 

B.  Stating  the  new  algorithm 

We  let  solvePTMILP(T,  n,  9i,  62)  denote  a  function  which 
takes  as  input  a  taskset  r  and  a  computer  platform  n  and  9i 
and  02  and  returns  a  tuple  (/,  X)  where  /  is  a  boolean  and 
X  is  a  matrix  with  the  following  interpretation:  if  Fig.  8  is 
feasible  then  /  is  true  and  X  is  the  solution;  if  Fig.  8  is 
infeasible  then  /  is  false  and  X  is  undefined. 

Algorithm  1  lists  the  pseudo-code  for  evaluating  the  func¬ 
tion  solvePTMILP (r,  n,  9X,  02). 

Definition  8. 

i?.(II)  =  4  +  max  ( 1 - — .  1  ] 

V  m  1  '  m2  J 


Algorithm  2  shows  how  the  assignment  of  tasks  to  processor 
types  works. 


(mi  —  (mi  —  1)  x  9i)  x  to  x  s  < 
(mi  —  (mi  —  1)  x  9i)  x  t  x  s 


Theorem  1.  If  (f,X)  =  solvePTMILP (t,  n,  6>i,  02)  and  f 
(18)  is  true  and  tasks  are  assigned  to  processor  types  according 


6 


Algorithm  1:  An  algorithm  for  evaluating  the  function 

solvePTMILP(T,n,0i,02). 


Input  :  A  taskset  r  and  a  two-type  platform  II  and  6\  and  62 
Output:  A  tuple  (f,X)  where  /  is  a  boolean  and  X  is  a  matrix 
if  H 12  =  0  then 

Solve  the  LP  in  Fig.  7  and  obtain  a  vertex  solution 
if  this  LP  is  feasible  then 

Let  X'  denote  this  solution. 

Let  F  denote  a  set  of  indices  of  tasks  in  L  such  that 

0'i  f  1)  V  (x'l  f  1). 

Let  us  introduce  ^found  which  is  an  assignment  of  values  to 
the  Xi  -variables  whose  subscript  index  is  in  F;  this 
assignment  is  initialized  to  be  undefined. 

Let  us  introduce  a  local  variable  foundPTMILP  that  is 
boolean  and  initialize  it  to  false. 

foreach  assignment  of  0-1  to  the  xi-variables  whose  subscript 
index  is  in  F  do 

Evaluate  if  Fig.  8  is  satisfied  for  this  assignment 
if  the  above  evaluation  yields  true  then 

Let  x  denote  the  assignment  of  0-1  to  the 
x i  -variables  whose  subscript  index  is  in  F 
if  foundPTMILP  =  false  then 
Set  foundPTMILP  to  true 


Set  xfound  to  % 


end 

end 

end 

if  foundPTMILP  then 

Form  the  matrix  X  as  follows: 

For  each  i  E  F:  Assign  x\  and  x ?  according  to 

vfound 
X. 

For  each  i  E  L\F:  Assign  xj  and  x?  according  to  X' . 
For  each  i  £  HI:  Assign  x J  =  1  and  x?  =  0 
For  each  i  E  H2:  Assign  x\  =  0  and  x%  =  1 
return  (true,  X) 

return  (false,  X') 


else 


end 


else 


return  (false,  X),  where  X  is  undefined. 


end 


else 


end 


return  (false,  X),  where  X  is  undefined. 


Algorithm  2:  The  new  intra-migrative  task  assignment 
algorithm  for  two-type  heterogeneous  multiprocessors. 
Input  :  A  taskset  r  and  a  two-type  platform  II 

Output:  An  assignment  of  tasks  to  processor  types  indicated  by  matrix 
X 

1  (/,  X)  :=  solvePTMILP (r,  II,  1/R(II),  1/R(II)) 

2  if  (/  =  true)  then 

3  |  declare  SUCCESS  and  stop 

4  else 

5  |  declare  FAILURE  and  stop 

6  end 


to  the  X-matrix  and  tasks  are  scheduled  with  gEDF  on  each 
processor  type  then  all  deadlines  will  be  met  at  run-time. 


all  deadlines  will  be  met  at  run-time. 

Proof:  Follows  from  Theorem  1.  ■ 

C.  Proving  the  time  complexity  of  the  new  algorithm 
Lemma  8.  |TS(r, 0r)|  =  Llog2  DMiNxa-ec-l  +  2  and 
|TS(r,02)|  =  Llog2  DMINx(i_e2)J  +2 

Proof:  Follows  from  the  definition  of  TS  —  see  Defini¬ 
tion  7.  ■ 

Lemma  9.  After  line  5  of  Algorithm  1  has  executed,  it  holds 

that:  |F |  <  [log2  dminx  (i-0i)  J  +  L^°S2  dminx(i-02)J 

Proof:  In  the  LP,  solved  on  line  2  of  Algo¬ 
rithm  1,  there  are  2  x  L  variables  and  there  are 
L1o§2  dminx(i— 0i)J  +  L1oS2  dminx(i-02)J  +  6  +  |L|  con¬ 
straints  ([log2  DMiNxfi-flr)  J  2  constraints  of  Cl;  the  same 
number  of  C2;  one  constraint  of  C8;  one  constraint  of 
C9;  \L\  constraints  of  C3).  Let  X'  denote  the  solution  of 
the  LP  in  Fig.  7.  Since  for  each  vertex  solution  of  LP, 
it  holds  that  the  number  of  non-zero  variables  is  at  most 
the  number  of  constraints,  it  follows  that  there  are  at  most 

L1o§2  DMtNxa— 8l)J  +  L!og2  dminx(uU)J  +  6  +  \L\  ™n-zero 
values  of  the  X'  variables.  For  each  task  in  L\F,  it  holds 
that  there  is  exactly  one  non-zero  value  in  X' .  For  each  task 
in  F,  it  holds  that  there  is  exactly  two  non-zero  values  in  X' . 
Hence,  there  are  lx  (|L|  —  |F|)  +  2  x  \F\  non-zero  values  of 
the  X'  variables.  Consequently:  1  x  (|L|  —  |F|)  +  2  x  \F\  < 

U°g2  dminx (~i— ) -I  +  L!og2  DMiNxa-g2)J  +6+W-  Rewriting 

yields:  \F\  <  [log2  dmin>^(i=07)  J  +  II0g2  dminx(i-02)  J 

This  states  the  lemma.  ■ 

Lemma  10.  The  number  of  iterations  of  the  for- loop  on  line  8 
of  Algorithm  1  is  at  most:  (™ax±dmax)2  x 

Proof:  The  number  of  iterations  is  at  most  2'F\.  Using 
Lemma  9  yields  that  the  number  of  iterations  is  at  most: 

2L1oS2  DMIN  ‘xa-o^J  +  Poga  DMINx\  r=^yJ+6 


Observing  that  mi(r)  <  2  x  (TMAX  +  DMAX)  and  rewriting 
yields  the  lemma.  ■ 


Lemma  11.  The  time  complexity  of  Algorithm  1  is 

n  (noli,  4-  (  tmax+dmax)  2  _  i  S 

c ' \poiy  +  {  dmin  I  x  (i-e1)x(i-e2) )■ 


Proof:  Follows  from  the  facts  that  (i)  linear  programs 
can  be  solved  in  polynomial  time  [41]  and  hence  line  2  of 
Algorithm  1  can  be  performed  in  polynomial  time  and  (ii)  the 


for  number  of  combinations  iterated  through  in  the  for-loop  of 
line  8  is  at  most  (™ax±dmax)2  x  (Follows 

from  Lemma  10.)  ■ 


Proof:  Follows  from  Lemma  7  and  the  fact  that  Eq.  (6) 
is  a  schedulability  test.  ■ 

Theorem  2.  If  Algorithm  2  declares  success  and  tasks  are 
assigned  to  processor  types  according  to  the  X-matrix  and 
tasks  are  scheduled  with  gEDF  on  each  processor  type  then 


Theorem  3.  The  time  complexity  of  Algorithm  2  is 

o(poi,j+{™^ rAK/)- 

Proof:  Since  Algorithm  2  calls  solvePTMILP  once 
with  0i  =  02  =  l/i?(n)  it  follows  that  (us¬ 
ing  Lemma  11)  the  time  complexity  of  Algorithm  2  is 
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O  { nol'ii  4-  (  TMAX+DMAX  \ 2  y  _ l _ \  qu_ 

vypuiy- 1-  ^  DMIN  )  *  (l-l/R(Il))x(l~l/B(Il))  )■ 

serving  that  4  <  R(H)  yields  that  the  time  complexity  of 
Algorithm  2  is  O  (poly  +  (™Ap^gJMAX)2)  •  ■ 


D.  Proving  the  speedup  factor  of  the  new  algorithm 

We  will  start  by  discussing  necessary  conditions  for  intra- 
migrative  feasibility  and  then  to  prove  the  speedup  factor. 

Lemma  12.  Consider  a  taskset  r  and  a  computer  platform 
II.  If  t  is  intra-migrative  feasible  on  II  then  there  exists  a 
matrix  X  such  that  all  constraints  in  Fig.  9  are  satisfied. 

Proof:  Follows  from  the  fact  that  Eq.  (7)  is  a  necessary 
condition  for  feasibility.  ■ 


tion  8)  yields: 

s  „  ,  mi  —  1 .  s 

"*■  * 1  x  Rm x  2  - (mi  -  isr1  2 

s  „  .  m.2  —  1 ,  s 

m2  x  t  x  — - — -  x  2  <  (mo - )  X  t  X  - 

R( n)  - v  R(n)  ;  2 

s  mi  - 1  s 

mt  x  «(S) s  (mi~  im)  x  2 

s  ,  ,  m2  —  1 .  s 

“  R(U)  ~  y  R{ n)  ;  2 

Hence,  if  X  satisfies  Fig.  11  then  it  also  satisfies  Fig.  12. 
Combining  this  with  Lemma  17  yields  the  lemma.  ■ 


(19) 

(20) 
(21) 
(22) 


Lemma  19.  Consider  a  taskset  r  and  a  computer  platform  n. 
If  there  exists  a  matrix  X  such  that  all  constraints  in  Fig.  12 
are  satisfied  then  Algorithm  2  declares  SUCCESS. 


Lemma  13.  Consider  a  taskset  r  and  a  computer  platform  n. 
If  r  is  intra-migrative  feasible  on  IIx  l/i?(n)  then  there  exists 
a  matrix  X  such  that  all  constraints  in  Fig.  10  are  satisfied. 

Proof:  Follows  from  applying  Lemma  12  on  n  x 
(l/i?(n))  and  then  considering  t  — >  oo  on  Cl  and  C2  yields 
C8  and  C9  respectively.  ■ 

Lemma  14.  VQ  >  0  : 

(Vf  >  0  :  (X^rier  ffdbf1  (n,t,9 i,s))  <  m  x  t  x  Q) 

=$■ 

(Vf  G  TS(t,9i)  :  (j2TieT  ffdbf*1  (n,  t,  Ox,  s,  t)J  <  m  x  t  x 
0x2) 

Proof:  See  Appendix.  ■ 

Lemma  15.  VQ  >  0  : 

(Vf  >  0  :  (Z^Tier  ffdbf2  ( Ti,t,92,s ))  <  m  x  t  x  Q) 

=> 

(Vf  G  TS(t,  9 f)  ■■  ^TiSr  ffdbf*“  (n,  t,  02,  s,t)^  <mxtx 
0x2) 

Proof:  See  Appendix.  ■ 

Lemma  16.  Consider  a  taskset  r  and  a  computer  platform 
n.  If  X  satisfies  Fig.  10  then  X  satisfies  Fig.  11. 

Proof:  Follows  from  applying  Lemma  14  on  Cl  in  Fig.  10 
and  applying  Lemma  15  on  C2  in  Fig.  10.  ■ 

Lemma  17.  Consider  a  taskset  r  and  a  computer  platform  n. 
If  r  is  intra-migrative  feasible  on  nx  l/i?(n)  then  there  exists 
a  matrix  X  such  that  all  constraints  in  Fig.  11  are  satisfied. 

Proof:  Follows  Lemma  13  and  Lemma  16.  ■ 

Lemma  18.  Consider  a  taskset  r  and  a  computer  platform  n. 
If  r  is  intra-migrative  feasible  on  nx  l/i?(n)  then  there  exists 
a  matrix  X  such  that  all  constraints  in  Fig.  12  are  satisfied. 

Proof:  Algebraic  manipulations  of  IK  II)  (from  Defini¬ 


Proof:  Let  us  suppose  that  the  lemma  was  false.  Then 
there  is  a  taskset  r  and  a  computer  platform  n  such  that 

there  exists  a  matrix  X  such  that  all  constraints  in 

Fig.  12  are  satisfied  (23) 

and  Algorithm  2  declares  FAILURE. 

Relaxing  C4  in  Fig.  12  yields: 

there  exists  a  matrix  X  such  that  all  constraints  in 

Fig.  13  are  satisfied  (24) 

Eq.  (23)  yields 

H12(l/i?(n),l/i?(n),r)  =0  (25) 

Consider  Fig.  7  with  9\  =  92  =  1/R  (n)  and  compare  with 
Fig.  13.  They  are  identical.  Hence: 

there  is  a  matrix  X  such  that  all  constraints 
in  Fig.  7  are  satisfied  for  9\=  92  —  l/i?.(n)  (26) 

From  the  statement  that  Algorithm  2  declares  FAIL¬ 
URE  it  follows  that  Algorithm  1  fails  for  the  input 
r,  n,  l/i?(n),  1/I?(n).  Since  it  fails,  let  us  explore  the  possi¬ 
ble  lines  at  which  it  can  fail. 

Case  1.  Algorithm  declares  FAILURE  on  line  32. 

Te  condition  of  the  case  yields  H12(l/i?  (n) ,  1/R  (n) ,  r) 

0.  But  this  contradicts  Eq.  (25). 

Case  2.  Algorithm  declares  FAILURE  on  line  29. 

From  the  condition  of  the  case,  it  follows  that 

there  exists  no  matrix  X  such  that  all  constraints 
in  Fig.  7  are  satisfied  for  9\  =  92  =  l/i?(n) 

But  this  contradicts  Eq.  (26). 

Case  3.  Algorithm  declares  FAILURE  on  line  26. 

From  the  condition  of  the  case,  it  follows  that  foundPTMILP 
is  false  when  the  algorithm  declares  FAILURE  on  line  26.  Let 
us  partition  r  into  F  and  t\F.  Note  that  for  t\F  it  holds  that 
X '  satisfies  Fig.  7  and  since  this  set  of  tasks  have  xj  and  xf 


Cl. 

Vt  >  0  :  feri6T  ffdbf1  (t;,  t,  1/R  (II)  ,  s)  X  xj) 

<  mi  x  t  x  (s/i?(Il)) 

C2. 

Vt  >  0  :  (Eri6T  ffdbf2  (t;,  t,  1/1?  (II)  ,  s)  X  x2J 

<  m2  x  t  x  (s/i?(Il)) 

C3. 

Vt;  S  t  :  x  j  +  x?  =  1 

C4. 

Vt;  £  t  :  4  £  (0, 1}  and  xj  G  (0, 1} 

Vt;  6  Hl(l /R  (II) ,  1/R  (n)  ,  t)  :  xj  =  1 

Vt;  G  H2(l/R  (n) ,  1/R  (II) ,  t)  :  xj  =  1 

C5. 

C6. 

Cl. 

H12(l/l?(n),l/l?(n),T)  =  0 

C8. 

E neACl/Ti)  X  xj  <  mi  x  (s/1? (H)) 

C9. 

E TieACf/Ti)  x  xj  <  m2  x  (s/1?  (n)) 

Fig.  10:  Constraints  expressing  a  necessary  intra-migrative  feasibility  condition;  rewritten. 


Cl. 

Vt  GTS(t,1/R(II))  :  fev-ier  ffdbf*1  (t;,  t,  1/R  (n) ,  s,  t)  x  xll 

i  <  mi  x  tx  ( s/R(Il ))  x  2 

C2. 

Vt  G  TS(t,  1/R  (n))  :  (Et.  Gr  ffdbf*2  (t;,  t,  1/1?  (II) ,  s,  t)  x  x2J 

i  <  m2  x  t  x  ( s/R(Il ))  x  2 

C3. 

Vt;  G  T  :  xl  +  x2  =  1 

Vt;  G  t  \  xj  C  (0, 1}  and  xj  G  (0, 1} 

C4. 

C5. 

Vt;  G  Hl(l/R  (II) ,  1/R  (n) ,  t)  :  xj  =  1 

Vt;  G  H2(l/R  (n) ,  1/R  (n) ,  t)  :  xj  =  1 

C6. 

Cl. 

H12(l/R(n),l/R(n),T)  =  0 

C8. 

E^e.^/T;)  x  xj  <  mi  x  (s/R(n)) 

C9. 

E T,er(C?/Ti)  xxj<  m2  x  (s/R(n)) 

Fig.  11:  Constraints  expressing  a  necessary  intra-migrative  feasibility  condition;  rewritten  further. 


Cl.  Vt  S  TS(t,  1/R  (II))  :  (ETiGT  ffdbf*1  (Ti,t,l/R(U),s,T)  x  xj)  <  (mi  -  (mi  -  1)  x  1/K  (H))  xlxsxl/2 

C2.  Vt  S  TS{t,1/R(U))  :  (E^gt  ffdbf*2  (ri,t,l/R(n),s,r)  x  xj)  <  (m2  -  ( m2  -  1)  x  1/R  (H))  xlxsxl/2 

C3.  Vt;  G  T  :  x|  +  X?  =  1 
C4.  Vt;  6r:i|e  (0,  1}  and  xj  G  (0, 1} 

C5.  Vt;  G  Hl(l/R(n),l/R(n),r)  :  xj  =  1 
C6.  Vr;  6  H2(l/R  (II) ,  1/R  (II)  ,r):ij  s?l 
C7.  H12(l/R(II),l/R(n),T)  =  0 

C8.  Er  eACi/Ti)  X  4  <  ((™i  -  (“1  -  1)  X  1/R  (IT))  x  s  x  1/2) 

C9.  E r'.eACf/Ti)  xxj<  ((m2  -  (m2  -  1)  x  1/R  (II))  x  s  x  1/2) 


Fig.  12:  Constraints  expressing  a  necessary  intra-migrative  feasibility  condition;  rewritten  even  more. 


Cl.  Vt  S  TS(t,  1/R  (II))  :  (EriGrffdbf*1  (t;,  t,  1/1?  (II) ,  s,  t)  X  xf)  <  (mi  -  (mi  -  1)  X  1/1?  (II))  xixsxl/2 

C2.  Vt  S  TS(t,  1/1?  (II))  :  (EriGTffdbf*2  (Ti,t,l/R(U),s,T)  x  xj\  <  (m2  -  (m2  -  1)  x  1/1?  (H))  xixsxl/2 

C3.  Vt;  G  t  :  xj  +  x2  =  1 

C4.  Vt;  6  T  :  xj  >  0  and  x?  >  0 

C5.  Vt;  6  Hl(l/1?  (II) ,  1/R  (II) ,  t)  :  xl  4=  1 

C6.  Vt;  6  H2(l/R  (II) ,  1/1?  (II) ,  t)  :  X;  =  1 

C7.  Hi2(i/i?(n),i/i?(n),T)  =  0 

C8.  Er.6 rifll/Ti)  X  xl  <  (K  -  (mi  -  1)  x  1/1? (n))  x  S  x  1/2) 

C9.  E  rleT(Cf/T;)  x  x?  <  ((m2  -  (m2  -  1)  x  1/R  (II))  x  s  x  1/2) 


Fig.  13:  Constraints  expressing  a  necessary  intra-migrative  feasibility  condition;  rewritten  to  LP. 


being  integers  (follows  from  the  fact  that  it  does  not  contain  the  tasks  in  F),  it  follows  that  X'  also  satisfies  the  following 
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constraints:  Fig.  8  where  in  the  expression  on  the  right-hand 
side  of  C1,C2,C8,C9,  the  symbol  s  is  replaced  by  s/2.  Note 
that  F  C  t  and  hence  from  Eq.  (23),  it  follows  that  for  F,  there 
is  an  X  that  satisfies  the  following  constraints:  Fig.  8  where 
in  the  expression  on  the  right-hand  side  of  C1,C2,C8,C9,  the 
symbol  s  is  replaced  by  s/2.  Adding  X'  and  X  gives  us  a  new 
matrix  that  satisfies  Fig.  8  and  this  yields  that  foundPTMILP 
is  true.  This  is  a  contradiction. 

It  can  be  seen  that  if  the  lemma  is  false  then  each  case 
results  in  contradiction.  Hence,  the  lemma  is  true.  ■ 

Theorem  4.  Consider  a  taskset  r  and  a  computer  platform 
n.  if  r  is  intra-migrative  feasible  on  n  x  l/f?(n)  then 
Algorithm  2  declares  SUCCESS. 

Proof:  Follows  from  Lemma  18  and  Lemma  19.  ■ 

Hence,  the  speedup  factor  of  Algorithm  2  is  f?(n). 

V.  Conclusions 

The  problem  of  scheduling  real-time  tasks  on  a  heteroge¬ 
neous  multiprocessor  has  received  increasing  attention  from 
researchers  during  recent  years  but  no  solution  was  available 
for  parallel  tasks  with  proven  speedup  factor.  Therefore,  in 
this  paper,  we  have  presented  the  first  algorithm  for  scheduling 
parallel  tasks  on  a  heterogeneous  multiprocessor  with  proven 
speedup  factor.  We  did  so  by  focusing  on  constrained-deadline 
sporadic  tasks  and  a  heterogeneous  multiprocessor  where  pro¬ 
cessors  are  of  two  types  and  we  presented  a  new  algorithm  that 
assigns  tasks  to  processor  types  and  then  apply  global-Earliest- 
Deadline-First  on  each  type  of  processors.  Our  new  algorithm 
has  pseudo-polynomial  time  complexity  and  speedup  factor  at 
most  5. 
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Appendix 

A.  Proof  of  Lemma  4 

In  this  section,  we  prove  Lemma  4  and  we  do  this  incre¬ 
mentally,  i.e.,  by  proving  some  basic  results  and  then  merging 
them  to  obtain  the  desired  result. 


Lemma  20.  Vt  >  mi(r)  :  ffdbf  (ty,  t,  v,  s)  < 
ffdbf  (t*,  mi(r),  v,  s )  +  [(Ji  +  ^  x  (t  —  mi(r))^ 

Proof:  Algebraic  manipulations  yield: 

t  =  mi(r)  +  (t  —  nii(r)) 
t  —  mi(r) 


=  mi(r)  + 

<  mi(r)  + 

<  mi(r)  + 


Ti 

t  —  mi(r) 

Ti 

t  —  mi(r) 
Ti 


X  Ti  +  (t  —  mi(r))  mod  T) 
xTi+Ti 
+  1  )  x  Ti 


Using  this  on  Lemma  1  yields: 

ffdbf  ( n ,  t ,  v,  s)  < 


ffdbf  mi(r)  + 

Using  Lemma  2  yields: 

ffdbf  ( Ti ,  t ,  v,  s)  < 


t  —  mi(r) 

Ti  . 


+  1  )  x  Ti,v,s 


ffdbf  (rj,  mi(r),  v,  s)  +  ^ 


t  —  mi(r) 


+  1  I  x  Ci 


Relaxing  the  bound  on  the  right-hand  side  and  rewriting  yields: 


ffdbf  ( Ti ,  t ,  v ,  s)  < 

ffdbf  (rj,  mi(r),  u,  s)  +  (t  —  mi(r))  x  — -  +  Ci 

Ti 

This  states  the  lemma.  ■ 

Lemma  21.  \/t  >  0 ,£  <  mi(r)  :  ffdbf  (ri,t,v,s)  < 
ffdbf  (TM2Llo«2tJ+i?l;)S) 

Proof:  Follows  from  Lemma  1  (monotonicity)  and  ob¬ 
serving  that  t  <  2Llog2  *1+!.  ■ 

We  now  restate  Lemma  4  and  prove  it. 

Lemma  4.  ffdbf  (n,  t,  v,  s)  <  ffdbf*  (r*,  t,  v,  s,  r) 

Proof:  We  need  to  consider  two  cases. 

Case  1.  t  >  mi(r):  For  this  case  Lemma  20  along  with  the 
definition  of  ffdbf  in  Eq.  (9)  proves  the  lemma. 

Case  2.  t  <  mi(r):  For  this  case  Lemma  21  along  with  the 
definition  of  ffdbf*  in  Eq.  (9)  proves  the  lemma.  ■ 

B.  Proof  of  Lemma  14  and  Lemma  15 

In  this  section,  we  prove  Lemma  14  and  Lemma  15  and  once 
again  we  do  this  incrementally,  i.e.,  by  proving  some  basic 
results  and  then  merging  them  to  obtain  the  desired  result. 

Lemma  22.  Vt  €  (mi(r),  mi(r) +  T,]  :  ffdbf*  ( Tift,v,s,T )  < 
ffdbf  (ri,  t,  v,  s)  +  2  x  Ci 

Proof:  Since  t  >  mi(r)  and  because  of  Lemma  1 
(monotonicity),  we  have: 

ffdbf  (ji,  mi(r),  v,  s)  <  ffdbf  (n,  t,  v,  s)  (27) 

Rewriting  yields: 
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ffdbf  (rj,  mi(r),  v,  s)  +  Ci  H - -  x  Ti  <  ffdbf  ( Ti,t ,  v,  s)  +  2  x  Ci  (28) 

Ti 

From  t  G  (mi(r),mi(r)  +  Ti],  we  obtain  that:  t  —  mi(r)  <  Ti. 
Applying  this  on  Eq.  (28)  yields:  \/t  G  (mi(r),mi(r)  +  Ti]  : 

Q. 

ffdbf  (t»,  mi(r),  v,  s)  +  Ci  H - -  x  (t  —  mi(r))  < 

Ti 

ffdbf  ( Ti,t ,  v,  s)  +  2  x  Ci  (29) 

Using  the  definition  of  ffdbf*  yields:  Mt  G 

(mi(r),mi(r)  +  X]  : 

ffdbf*  (ri ,  t ,  v ,  s,  t)  <  ffdbf  (ri,t,v,s)  +  2  x  Ci 

Hence  the  proof.  ■ 

Lemma  23.  Vi  >  mi(r)  :  ffdbf*  (r*,  t,  v,  s,  r)  < 
ffdbf  v,  s)  +  2  x  Ci 

Proof:  We  prove  this  by  contradiction.  If  the  lemma  was 
false  then  there  exist  a  t  such  that  i  >  mi(r)  and 

ffdbf*  ( Ti ,  t ,  v,  s,  r)  >  ffdbf  ( Ti,t,v,s )  +  2  x  Ci 

If  1  >  mi(r)  +  Ti  then  decreasing  i  by  Ti  decreases  the  left- 
hand  side  and  the  right-hand  side  of  the  above  inequality  by 
the  same  amount  (Ci).  Hence,  we  can  decrease  t  by  Ti  until 
it  holds  that  t  £  (mi(r),mi(r)  +2*].  And  this  gives  us  that 
there  exists  a  i  such  that  t  €  (mi(r),  rni(r)  +  Tf\  and 

ffdbf  (ri,t,  v,  s,  t)  >  ffdbf  (ri,t,  v,s)  +  2  X  Ci 

But  this  contradicts  Lemma  22  and  hence  it  is  not  possible 
that  lemma  under  discussion  is  false.  Hence  the  proof.  ■ 

Lemma  24.  Vi  >  rni(r)  :  ffdbf*  (Ti,  t,  v,  s,  r)  < 
ffdbf  (■ Ti,t ,  v,  s)  x  2 

Proof:  Since  t  >  mi(r),  it  follows  that  t  >  TMAX  + 
DMAX  and  then  it  follows  that: 


Proof:  Suppose  that  the  lemma  was  false.  Then  it  holds 
that  there  exists  a  Q,  v  and  s  such  that  Q>0,u>0,  s>0 
and 


^Vi  >  0  :  {Ti,t,v,s)^j  <mxtx  Q^j 

A 

3 1  £  TS(t,9)  :  ffdbf*  (t*,  t,  v,  s,  r)^  >  m  x  i  x  Q 


Since  there  exists  a  t  in  TS(t,  9)  such  that  the  last  constraint 
is  true,  let  us  choose  one  of  them  and  call  it  to-  This  gives  us: 

^Vt  >  0  :  ffdbf  (ji,  t,  v,  s)^  <  m  x  i  x  Q^j 

A 

^  ffdbf*  (r,,  t0,  v,  s,  r)^  >  m  x  f0  x  Q  x  2  j 


Let  us  consider  two  cases: 

Case  1:  to  >  mi(r).  Applying  Lemma  24  on  the  last 
constraint  in  Eq.  (32)  yields: 

^Vi  s.t.  i  >  0  :  ffdbf  (Ti- t, v ,  s)^  <  to  x  i  x  (f'j 

A 

ffdbf  (r*,  to,  v,  s)  x  2  >  to  x  to  x  Q  x  2^ 

Dividing  the  last  constraint  by  2  yields: 


ffdbf  (Ti,t,  v,  s)  >  2  x  Ci 

Using  Lemma  23  and  Eq.  (30)  yields:  Vi  >  mi(r)  : 


ffdbf*  ( Ti,t ,  v,  s ,  r) 

< 

ffdbf  (Ti,  t,  v,  s)  +  2  x  Ci 

ffdbf  ( Ti,t ,  v,  s ) 

ffdbf  ( Ti ,  t,  v,  s) 

< 

2  x  Ci 

1  + 

ffdbf  (Ti,t,  v,  s) 

< 

2  x  Ci 
+  2  x  Ci 

< 

2 

Rewriting  yields:  Vi  >  mi(r)  : 

ffdbf  (Ti,t,v,s,r)  <  ffdbf  (t; ,  t ,  v,  s)  X  2 

This  states  the  lemma. 

Lemma  14.  VQ  >  0,Vu  >  0,Vs  >  0  : 

(Vi  >  0  :  (XT;€t  ffdbf  (n,  t,  v,  s))  <  m  x  t  x  Q) 


Vi  >  0  :  ^  ffdbf  (Tt,i,u,S)j  <  m  x  t  x  (f'j 
A 

ffdbf  ( Ti ,  t0,  v,  s)  >  m  x  t0  x  Q^j 


This  is  a  contradiction.  End  of  Case  1. 

Case  2:  to  <  mi(r).  Applying  2  x  to  on  the  1st  constraint 
in  Eq.  (32)  and  relaxing  the  last  constraint  yields: 

^  ffdbf  ( Ti ,  2  x  to,  v,  s)^  <  to  x  2  x  i0  x  q'J 
A 

^  ffdbf  ( Ti ,  to,  v,  s,  r)^  >  m  x  i0  x  Q  x  2  j 


(Vi  £  TS(t,  9)  :  (Xr  er  ffdbf*  ( Ti ,  t,  v,  s,  r))  <  mxtxQ  x  Since  to  £  TS(t,  9),  it  follows  from  the  definition  of  ffdbf* 
2)  that  ffdbf*  (r»,  io,  v,  s,  r)  =  ffdbf  (n,  2  x  t0,  v,  s).  Applying 


12 


this  on  the  last  constraint  yields: 


<  m  x  2  x  to  x 


>  m  x  t0  x  Q  x  2 


This  is  a  contradiction.  End  of  Case  2. 

It  can  be  seen  that  if  the  lemma  is  false  then  for  each  case, 
we  obtain  a  contradiction.  Hence,  the  lemma  is  true.  ■ 


Lemma  15.  VQ  >  0,Vs  >  0  : 

(Vt  >  0  :  (^C-riST  ffdbf2  ( Ti,t,92,s ))  <  m  x  t  x  Q) 

(Vf  e  TS(t,  62)  ■  (j2Tier  ffdbf*-  (n,t,  d2,  s)j  <  mxtx 

Qx2) 

Proof:  Analogous  to  the  proof  of  Lemma  14.  ■ 
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